Copula-Momentum Strategy: Quantitative Equity Trading

Author

Brayden Boyko & Shinto Kai

Published

April 9, 2025

Registered S3 method overwritten by 'tsibble':
  method               from 
  as_tibble.grouped_df dplyr

The Copula-Momentum strategy introduces a systematic trading approach that merges crash risk modeling with a momentum overlay. The approach leverages copula functions to select ETF pairs that exhibit low joint crash probabilities in volatility, and uses a Kalman-filter to mechanically approach momentum as the driver of position entry. All trades are capital-constrained, with risk control enforced via volatility-based position sizing. The result is a consistent, risk-managed strategy that seeks to exploit behavioral inefficiencies in the United States equities market.

Summary of Results

The following analysis highlights the robust performance observed in the strategy’s backtest results. The cumulative return chart reveals consistent outperformance compared to the SPY benchmark throughout the backtest period. However, this superior performance is crippled by realistic transaction costs. This is due to constant trading, as a result of extremely quick reversions in volatility momentum.

Complementing this, the positions component of the visual, presents a heatmap of consecutive holding periods by ETF product, demonstrates the portfolio’s diversified pair rotation and the dynamic nature of trade durations. The analysis shows that most pairs are held for relatively short periods, often under 15 days. This pattern aligns with the expected behavior of the Ornstein-Uhlenbeck process, further confirming the efficacy of the momentum overlay in capturing short-term divergences.

Adding to these findings, the strategy exhibits adaptability across various market conditions. The active rotation across sector pairs, combined with high turnover efficiency, underscores its dynamic nature. The strong performance appears to be driven by the strategic selection of pairs through copula filtering and the precise directional timing achieved via Kalman-filter-based momentum. Should the strategy be refined to capture longer term momentum, reducing trading costs; it may be viable to implement.

In the case of extending the hold period, the portfolio is exposed to greater potential for unexpected volatility shocks, which can be observed across all ETF pairs below. This can dismantle the portfolio and leave the investor exposed to margin calls and infinite loss. Thus it is deemed impossible to implement longer holding periods in the United States equities market. Insert Research Chart

Behavioural Rationale

At the core of the Copula-Momentum investment thesis lies the belief that equity ETF markets are heavily influenced by emotionally driven and often irrational investor behavior. This is because ETF transactions are primarily dominated by non-sophisticated investors. Because of this feature of the market, ETFs across industries are often disconnected. This disconnection is driven by investor preferences, biases, and manual trading. The strategy focuses on trading the decision-making behaviors of individuals, rather than the ETF products themselves. The ETFs merely serve as a proxy through which the strategy capitalize on these market dynamics.

This strategy is built on the premise that volatility is a reflection of slow information diffusion, bias, and uncertainty. Volatility shocks often arise from overreactions, underreactions, or misinterpretation of information; especially in the context of sector-specific catalysts. The strategy leverages copula models to pinpoint ETF pairs with minimal probability of concurrent volatility crashes, thereby mitigating exposure to correlation-driven risks within the portfolio. This approach enhances portfolio robustness by systematically reducing the impact of adverse interdependencies between assets. In addition, the integration of a momentum signal, which is generated from Kalman-filtered spread dynamics, provides the ability to seize short-term mispricings.

This approach allows for the systematic identification of emotionally driven divergences in ETF spreads, facilitating the opportunity to benefit from their eventual reversion. Essentially, it capitalizes on behavioral inefficiencies within a rules-based, systematic framework.

Qualitative Methodology Explanation

Explaining the copula-momentum strategy to clients may be difficult. This difficulty is due to the immense technical component related to the strategy. In order to explain how the strategy works from a logical standpoint, this section was created.

The copula-momentum strategy applies a rules based system to investing. The strategy takes advantage of mispriced assets across the United States stock market through relating their uncertainty amongst one another. Two assets are traded at one time, which allows for the strategy to capitalize on the relative mispricing, better known as spread, between them. This mispricing, due to uncertainty, is caused by investor bias and inefficient investing practices. The copula-momentum strategy actively searches for pairs of assets that should not experience this uncertainty at the same time. It does this through a mathematical function called the copula process. This math process creates probabilities that demonstrate how likely two assets are to experience uncertainty at the same time. This acts as the strategies initial screener, which tells the system the two assets to look at investing in. The market shows that there is always a discrepancy in uncertainty; however the copula process demonstrates which one is likely to experience no uncertainty. This allows the copula-momentum strategy to profit from uncertainty when it is unfairly represented.

Momentum signals are the primary decision making tool for whether or not to invest in the pair of assets selected in the copula process. Momentum in finance refers to the tendency of an asset’s price to continue moving in the same direction, either upward or downward, for a period of time. In the copula-momentum strategy, momentum is measured through a mathematical process called the Kalman-filter, which is used to estimate hidden trends or states from rapidly changing data. It continuously adjusts predictions based on new information, making it ideal for filtering out short-term fluctuations while capturing the underlying direction. In the case of the copula-momentum strategy, the difference between the two selected assets is passed through the filter to determine the “true momentum” which doesn’t include fluctuations. This allows the strategy to generate predictor values, which are used in judging whether or not enter positions.

Data Overview

Nine highly liquid U.S. sector ETFs were selected for this strategy:

Ticker Industry
XLK Technology
XLV Health Care
XLF Financials
XLE Energy
XLI Industrials
XLU Utilities
XLY Consumer Discretionary
XLP Consumer Staples
XLB Materials

Daily adjusted closing prices are retrieved from January 1, 2020, to April 8, 2025. The model is trained on data from January 1, 2020 December 31, 2022. The out-of-sample backtest is conducted from January 1, 2023, to the present.

Research & Signal Construction

Crash Risk Estimation via Copulas

During each day within the training window, 30-day changes in volatility across all ETFs are analyzed. These changes are transformed into pseudo-observations through a ranking methodology, which eliminates scale effects and facilitates robust joint modeling. Using a rolling 91-day window, we evaluate every ETF pair under five candidate copula types (Gaussian, t, Clayton, Gumbel, and Frank). Each model is fitted using maximum likelihood estimation (MLE) via the copula:fitCopula() function from R’s package, ensuring the parameters are statistically optimized based on historical data. We then compute the Akaike Information Criterion (AIC) for each fit, selecting the copula with the lowest AIC as the best model for that pair. As a result of speed challenges, a C++ function was implemented with simplified MLE assumptions. This opens the strategy to model risk, however dramatically increases the performance of the system.

For the chosen copula model, the joint crash probability is calculated, representing the likelihood of both ETFs undergoing volatility shocks beyond their respective 5th percentiles. This measure reflects the risk of concurrent emotional reactions between the two assets. The pair with the minimal joint crash probability in volatility is then selected for trading on that day.

Spread Momentum Forecasting

Advanced statistical techniques are employed to analyze financial spreads by estimating momentum and forecasting directional trends. The objective is to extract meaningful signals from noisy data, enabling robust trading strategies and risk modeling. By combining Kalman filtering and Ornstein-Uhlenbeck processes, dynamic momentum estimation and predictive capabilities are achieved. This method is utilized only in the training sample, and the parameters are applied to the backtest period. Implementing the strategy would require the parameters constantly be updated to reflect the market conditions.

The Kalman filter is applied to estimate momentum in 30-day ETF spreads by extracting a smoothed latent trend. The core goal is to distinguish persistent directional movements from short-term noise. The input variable, defined as the log return of the spread, provides a stabilized and scale-invariant representation of price dynamics. For each time step beyond an initial lookback period, a rolling window approach isolates recent observations to estimate the underlying momentum signal.

The Kalman-filter is defined as a univariate state-space model with a local level component. This is formally represented by:

\[ y_t = \mu_t + \epsilon_t \] \[ \mu_{t+1} = \mu_t + \eta_t \] where:

  • (y): Observed log return at time (t)
  • (μ): Latent momentum component
  • (ε): Observation noise (Gaussian disturbance with variance (H))
  • (Nu): Process noise (Gaussian disturbance with variance (Q))

Both variances are unknown and are estimated via maximum likelihood using the Broyden–Fletcher–Goldfarb–Shanno (BFGS) optimization algorithm.

Following estimation, Kalman filtering and smoothing recursions extract filtered state estimates, which serve as momentum signals. The most recent filtered state within each window is recorded as the momentum value for that period. This yields a time series of momentum estimates that dynamically adapt to changes in the underlying distribution of spreads, capturing evolving trends while filtering out fluctuations.

After selecting an ETF pair, the return spread is computed, and the Kalman filter is applied to estimate the latent trend within the spread. The smoothed trend serves as input for fitting an Ornstein-Uhlenbeck process, enabling the estimation of critical parameters. These ETF-spread specific critical parameters are calculated in the model training period, and remain constant for the entirety of the backtest period.

During the backtest, momentum values are forecasted based on the Ornstein-Uhlenbeck (OU) process. Trading signals are generated accordingly:

  • Speed of Reversion: Reflecting how quickly the spread returns to its mean.
  • Long-term Mean: Indicating the average value to which the spread reverts.
  • Volatility: Measuring the degree of fluctuation in the spread

These parameters are calculated for each pair, and are applied during the backtest period.

Pair Theta Mu Sigma
XLB_XLE 50.10 -0.07% 5.91%
XLB_XLF 81.10 0.01% 2.85%
XLB_XLI 131.70 0.00% 3.93%
XLB_XLK 83.96 -0.01% 4.86%
XLB_XLP 113.57 0.00% 6.82%
XLB_XLU 65.51 0.03% 3.70%
XLB_XLV 100.94 -0.01% 4.96%
XLB_XLY 103.86 0.03% 5.40%
XLE_XLF 97.00 0.04% 8.24%
XLE_XLI 96.39 0.03% 8.97%
XLE_XLK 65.62 0.02% 9.30%
XLE_XLP 123.16 0.02% 15.44%
XLE_XLU 110.39 0.04% 13.22%
XLE_XLV 116.61 0.01% 14.25%
XLE_XLY 120.47 0.04% 13.69%
XLF_XLI 75.67 -0.01% 2.24%
XLF_XLK 61.45 -0.02% 4.10%
XLF_XLP 83.30 -0.04% 5.81%
XLF_XLU 47.93 0.00% 4.18%
XLF_XLV 67.71 -0.03% 4.80%
XLF_XLY 62.80 0.01% 3.85%
XLI_XLK 120.95 -0.01% 5.98%
XLI_XLP 117.42 0.00% 6.25%
XLI_XLU 82.84 0.02% 4.27%
XLI_XLV 65.08 -0.02% 4.06%
XLI_XLY 108.95 0.02% 5.07%
XLK_XLP 77.98 -0.01% 3.68%
XLK_XLU 71.70 0.01% 3.68%
XLK_XLV 78.74 -0.01% 3.31%
XLK_XLY 111.31 0.04% 3.54%
XLP_XLU 100.95 0.02% 4.70%
XLP_XLV 132.13 -0.02% 4.52%
XLP_XLY 128.63 0.00% 9.47%
XLU_XLV 67.10 -0.04% 2.84%
XLU_XLY 75.48 0.00% 4.04%
XLV_XLY 75.94 0.02% 4.52%

The OU process was chosen as the method of modeling spread dynamics due to their mean reverting nature. This relationship can be observed below:

Combining Kalman filtering with Ornstein-Uhlenbeck processes provides a dynamic and adaptive framework for momentum estimation and trading signal generation. These techniques allow for precise measurement and forecasting, improving the reliability of spread-based trading strategies while addressing the complexities of financial market behavior

Position Sizing & Execution Rules

Each day a copula process is calculated using a 91 day lookback window. This provides the strategy with the current optimized pair. Utilizing the spread from this pair and the OU parameters, a momentum signal is generated. If the momentum signal is stronger than the previous trade, an order is executed. The system does not require positions in existing spread trades to be closed, prior to a new position being opened.

For the chosen ETF pair:

  • The ETF with expected momentum outperformance is given a long position; in contrast the underperformer is short sold. Each ETF’s weight is inversely proportional to its 30-day volatility.
  • The weights are normalized such that the sum of absolute weights \((|W1| + |W2|) =1\).
  • The position is held for a number of days equal to \(\frac{252}{\theta}\), derived from the mean reversion speed. This is a fixed constraint that does not change given any external factors.

Important Execution Assumption: Daily portfolio returns are computed by averaging the returns of all open trades on each day. This implicitly assumes that positions are rebalanced daily to maintain equal capital exposure, despite weights being computed at trade entry. We acknowledge this as a simplification that ensures consistent portfolio sizing across time; however would result in high transaction costs.

Important Position Assumption: The strategy assumes that the markets remain liquid at all times. Additionally, It assumes that transactions have no price impact and short-selling remains possible for each asset at a fixed annualized cost.

Incorporating Transaction Costs

In calculating strategy returns, an adjustment for transaction costs and short-sale borrowing fees are assumed to remain static.

  • Bid-Ask Spreads: Added for short positions and subtracted for long positions.
  • Short Sale Costs: Charged daily based on ETF-specific annualized borrow rates.

Costs are embedded directly into the adjusted returns prior to calculating daily portfolio profit and loss. The costs associated with each product can be observed here:

ETF Transactions ShortSale
XLK 0.01% 1.00%
XLV 0.35% 1.25%
XLF 0.04% 1.50%
XLE 0.08% 1.50%
XLI 0.04% 1.00%
XLU 0.04% 0.75%
XLY 0.04% 0.75%
XLP 0.02% 0.75%
XLB 0.04% 1.00%

Sensitivity Analysis: Impact of Leverage on Strategy Returns

In the base case, the strategy was implemented with a leverage level of 1 (L = 1), indicating full investment without the use of any leverage. This serves as the benchmark, providing a clear reference point with standard exposure and no borrowing.

\[|w_1|+|w_2| = L = 1\]

This ensures that all capital is deployed with no borrowing, serving as our baseline setup.

To analyze the impact of increased risk, the full trade simulation was conducted using leverage levels ranging from 1 to 4 (L = 1 to L = 4). The weights were adjusted to ensure that the total exposure consistently matched the target leverage. All other parameters, including trades and costs, remained unchanged, with only the position sizes being scaled up.

Strategy vs SPY: Cumulative Returns by Leverage Level

The strategy’s performance is evaluated in comparison to SPY across a range of leverage levels. This assessment highlights how the strategy fares under varying degrees of leverage exposure. The detailed results of this performance comparison are as follows:

Sensitivity Observations

  • The performance of the strategy demonstrates significant scaling as leverage increases. At a leverage level of 4 (L = 4), the strategy achieves substantial outperformance compared to SPY, highlighting its effectiveness under higher levels of risk exposure.
  • Even at leverage levels of 2 or 3 (L = 2 or L = 3), the strategy consistently outperforms SPY. This indicates that achieving a strong advantage does not require excessive leverage, demo
  • Bid-ask spreads and short sale costs were systematically accounted for, yet the returns continued to increase at higher leverage levels. This indicates that transaction costs are not hindering performance, even as leverage rises, underscoring the strategy’s resilience and efficiency in managing associated costs

Optimization Analysis: Impact of Transaction and Short-Sale Costs

In analyzing the impact of transaction and short sale costs on profitability (PnL) across various ETF spreads. The x-axis represents short sale costs, ranging from 0.0% to 2.0%, while the y-axis displays transaction costs, varying from 0.0% to 0.5%. Each heatmap corresponds to a specific ETF pair, using a color scale from red (negative profitability) to blue (positive profitability) to visualize outcomes under different cost conditions.

The results reveal notable differences in the performance of these spreads. Certain pairs, such as XLB-XLF, exhibit resilience, maintaining positive returns (blue zones) across a broader range of cost scenarios. This suggests that these spreads are less sensitive to cost increases and thus more viable under varying conditions. In contrast, spreads like XLB-XLE experience a sharper decline in profitability as costs rise, with red zones dominating under higher transaction and short sale costs. Across all spreads, the optimal conditions for profitability occur when both cost types are minimized, reinforcing the importance of cost control. Ultimately, the results highlight that spreads with greater resilience, such as XLB-XLF, offer strategic advantages, while careful cost management remains critical for maximizing returns. In considering this analysis, it is clear that certain spreads better fit the overall strategy. This is driven by the combined low transaction costs and the low Theta parameter which prolongs how long the position is held on the book for.

Risk Appetite

In defining investors’ risk appetite related to the copula-momentum strategy, the Omega ratio, Sharpe ratio, and maximum drawdown analysis were applied. In completing the analysis, the optimal client is defined as an accredited investor. Accredited investors are those who meet certain income or net worth criteria, allowing them to invest in complex or sophisticated types of securities that are not closely regulated. The nature of the strategy involves short-selling and the use of leverage to balance portfolios.

The risk-adjusted performance of the CopulaMomentum strategy was analyzed using three key metrics: the Sharpe Ratio, the Omega Ratio, and Maximum Drawdown. These metrics provide insights into the strategy’s efficiency in generating returns, its performance relative to downside risk, and its behavior during periods of market stress.

Metric Value
Sharpe Ratio (Annualized) 2.77
Omega Ratio 1.82
Maximum Drawdown 1.53%

Sharpe Ratio

The strategy demonstrates a strong return relative to its volatility, which is a notable achievement given that such high ratios are rarely observed in practice. While this performance suggests the model may be effectively capturing persistent return signals and managing volatility, it is important to account for nuances. For instance, inflated Sharpe Ratios can sometimes arise under extreme market conditions, small sample sizes, or overfitting during backtesting.

Omega Ratio

The Omega Ratio implies that the expected upside returns are more than double the downside risk (relative to a 0% threshold). This is a healthy sign, especially for long/short strategies where traditional metrics like Sharpe may not fully capture skewed distributions.

Max Drawdown

The maximum drawdown reflects the worst observed peak-to-trough decline in strategy equity. This is relatively shallow and suggests strong downside protection and risk controls. The drawdown chart confirms that pullbacks have been brief and contained, with no periods of prolonged distress. However, it should be noted that no single pair trade is ever held for longer than 15 consecutive days, which provides clear evidence that the max drawdown should remain shallow.

The drawdown graph shows that while the strategy does experience frequent small dips, these losses are generally shallow and short-lived. The largest drawdown over the period was just 2.71%, and recoveries tended to follow quickly. There are no long periods of sustained losses, which suggests that the model manages risk well and adapts effectively through its pair selection and rebalancing approach.

Key Assumptions

The model assumes that sector-based ETFs exhibit a level of volatility and momentum that can be measured. Volatility shocks are assumed to occur during moments of market stress or sentiment-driven dislocation, however should not occur in both positions at the same time. The model further assumes that extreme market events, such as margin calls or large sell-offs, follow predictable patterns in response to changes in volatility and sentiment, and these events are factored into the position-sizing mechanism.

The model also assumes that the copula function used for joint risk calculations provides an accurate reflection of the dependency structure between different ETFs across various sectors. For the sake of performance, some components of the copula function are hard coded and assumed to be true. In terms of trading assumptions, the model presumes that transaction costs, including short-sale fees and bid-ask spreads, are fixed. However, it should be noted that in times of extreme market volatility, liquidity constraints could lead to higher transaction costs than anticipated.

Lastly, the model assumes that the relationship between risk and return within sector ETFs holds steady enough for backtesting to accurately represent future performance. While out-of-sample testing is conducted to validate the strategy, the model assumes a reasonable level of consistency in market conditions over time.

Model Learnings

An analysis of the model suggests that the United States equities market may not be ideal for implementing the copula-momentum strategy. This is primarily due to the high price correlation among assets and the significant transaction costs involved. Although the strategy effectively filters momentum using the Kalman filter, the continuous trading model may require refinement. Specifically, positions should meet defined thresholds rather than merely exceeding the strength of the previous signal. In implementing the model, the Kalman filter was an interesting tool to learn, as its effectiveness across a range of tasks is clear.

The copula process proves effective in identifying ETF pairs with low crash probabilities. However, the model’s simplified structure could be expanded to provide a more accurate estimation of the shape and properties of the distributions. This refinement would enhance the model’s robustness and applicability across different market conditions. In implementing the model, understanding the different distributions proved to be a useful challenge which can be implemented in other areas of research.

Overall, while the copula-momentum strategy has demonstrated promise, particularly in ETF pair selection and momentum filtering, its applicability to the U.S. equity market is limited by high correlations and transaction costs. A more dynamic approach to modeling the copula’s dependence structure, combined with improved risk thresholds for trading signals, would enhance its robustness and adaptability to different market conditions.